Implement Flip transforms with CVCUDA backend #9277

zy1git · 2025-11-19T19:07:08Z

Summary:
Implemented _horizontal_flip_image_cvcuda and _vertical_flip_image_cvcuda kernels using cvcuda.flip operator. The kernels are automatically registered when CVCUDA is available and route cvcuda.Tensor inputs appropriately.

Test Plan:

Added test_functional_cvcuda and test_image_correctness_cvcuda tests
Verified parity between PyTorch and CVCUDA implementations
All tests pass with CVCUDA backend

pytorch-bot · 2025-11-19T19:07:12Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/vision/9277

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Dr CI is temporarily not working due to API fairewall

❌ 9 New Failures, 1 Cancelled Job, 3 Unrelated Failures

As of commit 98616f4 with merge base 617079d ():

NEW FAILURES - The following jobs have failed:

Lint / python-source-and-configs / linux-job (gh)
torchvision/transforms/v2/functional/_geometry.py:42:5: F811 redefinition of unused 'cvcuda' from line 40
Lint / python-types / linux-job (gh)
RuntimeError: Command docker exec -t dab625a6c7168b43d2e2a09bde8905fa1a7858d558c2274fde01a553e6e007c3 /exec failed with exit code 1
Tests / unittests-linux (3.10, linux.12xlarge, cpu) / linux-job (gh)
RuntimeError: Command docker exec -t eb0ca39d049e68d97f042afd1cce824d4be88996cf29fe93de0f059e2c8fbb22 /exec failed with exit code 2
Tests / unittests-linux (3.10, linux.g5.4xlarge.nvidia.gpu, cuda, 12.6) / linux-job (gh)
RuntimeError: Command docker exec -t 00a3c2f27318f2af042b4f0c351ad18052fa497f03008467903c6794948310ff /exec failed with exit code 2
Tests / unittests-linux (3.11, linux.12xlarge, cpu) / linux-job (gh)
RuntimeError: Command docker exec -t 06de074df545fff23be9feafcd6a07ce868d1be66f9f49bce2dcf230c4d5ede2 /exec failed with exit code 2
Tests / unittests-linux (3.12, linux.12xlarge, cpu) / linux-job (gh)
RuntimeError: Command docker exec -t 064b35b63210e33079eb771e83be47021f6a93ddf26ef2b4fa5f748cab0820d1 /exec failed with exit code 2
Tests / unittests-macos (3.10, macos-m1-stable) / macos-job (gh)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 2
Tests / unittests-windows (3.10, windows.4xlarge, cpu) / windows-job (gh)
TypeError: results.testsuites.testsuite.testcase is not iterable
Tests / unittests-windows (3.12, windows.4xlarge, cpu) / windows-job (gh)
TypeError: results.testsuites.testsuite.testcase is not iterable

CANCELLED JOB - The following job was cancelled. Please retry:

Build Linux Wheels / pytorch/vision / build-manywheel-py3_10-xpu (gh)

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

Tests / unittests-macos (3.12, macos-m1-stable) / macos-job (gh) (detected as infra flaky with no log or failing log classifier)
Tests / unittests-windows (3.11, windows.4xlarge, cpu) / windows-job (gh) (detected as infra flaky with no log or failing log classifier)

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

Build Linux Wheels / pytorch/vision / upload / upload-manywheel-py3_10-xpu (gh) (trunk failure)
Unable to download artifact(s): Artifact not found for name: pytorch_vision__3.10_xpu_x86_64

This comment was automatically generated by Dr. CI and updates every 15 minutes.

test/test_transforms_v2.py

torchvision/transforms/v2/functional/_geometry.py

justincdavis · 2025-11-19T23:35:51Z

@zy1git What is the strategy for creating the tests for the transforms with CV-CUDA backends? Do we want to have all the tests live entirely inside the existing classes or make a new class?

The PRs for gaussian_blur, normalize, and to_dtype I made all use new classes, but I can switch it be more centralized.

AntoineSimoulin

Thanks a lot for submitting this PR! This is looking good. I added some comments to make sure we have an extensive test coverage:)

test/test_transforms_v2.py

torchvision/transforms/v2/functional/_geometry.py

NicolasHug · 2025-11-20T12:49:59Z

@justincdavis replying to your question in #9277 (comment): we prefer centralizing the tests in the existing test class. The idea is that, as much as possible, we'd just add CV-CUDA as a parametrization entry with pytest.mark.parametrize to the existing tests. Antoine and I left a few comments related to that above. Does that make sense?

justincdavis · 2025-11-20T16:31:11Z

@NicolasHug Makes sense! I will follow the comments you and Antoine left on this PR

… according to the comments

AntoineSimoulin

Thanks a lot for addressing the initial comments! I left some final adjustments to make. Let's also make sure linting and tests are passing!

AntoineSimoulin · 2025-11-25T17:25:36Z

test/test_transforms_v2.py

+        if isinstance(image, cvcuda.Tensor):
+            # For CVCUDA input
+            expected = F.vertical_flip(F.cvcuda_to_tensor(image))
+            assert_equal(F.cvcuda_to_tensor(actual), expected)
+        else:
+            # For PIL/regular image input
+            expected = F.to_image(F.vertical_flip(F.to_pil_image(image)))
+            assert_equal(actual, expected)


Let's:

remove the first call to F.cvcuda_to_tensor(image) so that the flip operation is done on the cvcuda tensor

simplify with a single assert_equal with:

if isinstance(image, cvcuda.Tensor): expected = F.cvcuda_to_tensor(F.vertical_flip(image)) else: expected = F.to_image(F.vertical_flip(F.to_pil_image(image))) assert_equal(actual, expected)

See this comment discussion: #9277 (comment)

AntoineSimoulin · 2025-11-25T17:28:07Z

test/test_transforms_v2.py

+        if isinstance(image, cvcuda.Tensor):
+            # For CVCUDA input
+            expected = F.horizontal_flip(F.cvcuda_to_tensor(image))
+            print("actual is ", F.cvcuda_to_tensor(actual))
+            print("expected is ", expected)
+            assert_equal(F.cvcuda_to_tensor(actual), expected)
+
+        else:
+            # For PIL/regular image input
+            expected = F.to_image(F.horizontal_flip(F.to_pil_image(image)))
+            assert_equal(actual, expected)


Let's:

remove the print statement

remove the first call to F.cvcuda_to_tensor(image) so that the flip operation is done on the cvcuda tensor

simplify with a single assert_equal with:

if isinstance(image, cvcuda.Tensor): expected = F.cvcuda_to_tensor(F.vertical_flip(image)) else: expected = F.to_image(F.vertical_flip(F.to_pil_image(image))) assert_equal(actual, expected)

Hi @AntoineSimoulin,

Thanks a lot for the comment. After taking a closer look, I think my implementation actually works well.

1, The thing is for cvcuda input, the actual = fn(image) is a cvcuda tensor but the assert_equal could not handle the cvcuda tensor so I get an error using your suggestion: "E TypeError: No comparison pair was able to handle inputs of type <class 'nvcv.Tensor'> and <class 'nvcv.Tensor'>." Thus I have to convert the actual to the torch tensor and compare by using assert_equal(F.cvcuda_to_tensor(actual), expected). (Did we add the cvcuda tensor comparison into the assert _equal()? If so, I think I can make it consistent with the PIL implementation for this part.)

2, And for the expected = F.horizontal_flip(F.cvcuda_to_tensor(image)), I think the logic is to convert the cvcuda tensor to torch tensor first and then apply the flip to get the expected format just like the PIL image part F.horizontal_flip(F.to_pil_image(image)). And the reason the code uses "expected = F.to_image(F.horizontal_flip(F.to_pil_image(image)))" for PIL image is because the assert_equal can handle the PIL image comparison.

Please let me know if I understand correctly.
I will definitely remove the 'print' statement and the unnecessary comments.

AntoineSimoulin · 2025-11-25T17:48:52Z

torchvision/transforms/v2/_geometry.py

+    if CVCUDA_AVAILABLE:
+        _transformed_types = (torch.Tensor, PIL.Image.Image, cvcuda.Tensor)
+


I feel we should maybe add cvcuda.Tensor to _transformed_types in the base class Transform under "vision/torchvision/transforms/v2/_transform.py". This way, we don't have to add it every time. Maybe something like:

# Class attribute defining transformed types. Other types are passed-through without any transformation # We support both Types and callables that are able to do further checks on the type of the input. _transformed_types: tuple[type | Callable[[Any], bool], ...] = (torch.Tensor, PIL.Image.Image) if CVCUDA_AVAILABLE: _transformed_types += (cvcuda.Tensor,)

Regarding this, I have run into this issue on the other transform PRs. To solve this, I added a is_cvcuda_tensor function and added this to the _transformed_types and in the query_size function. What is the preferred method for handling this, the tuple setup or using a function?

torchvision/transforms/v2/functional/_geometry.py

justincdavis · 2025-11-25T18:04:04Z

test/test_transforms_v2.py

+    def test_image_correctness(self, fn, make_input):
+        image = make_input()
+        actual = fn(image)
+        if isinstance(image, cvcuda.Tensor):


you can dispatch based on the make_input function itself to avoid issues if CV-CUDA is not available

meta-cla bot added the cla signed label Nov 19, 2025

justincdavis reviewed Nov 19, 2025

View reviewed changes

test/test_transforms_v2.py Outdated Show resolved Hide resolved

justincdavis reviewed Nov 19, 2025

View reviewed changes

test/test_transforms_v2.py Outdated Show resolved Hide resolved

justincdavis reviewed Nov 19, 2025

View reviewed changes

torchvision/transforms/v2/functional/_geometry.py Outdated Show resolved Hide resolved

justincdavis reviewed Nov 19, 2025

View reviewed changes

torchvision/transforms/v2/functional/_geometry.py Outdated Show resolved Hide resolved

zy1git force-pushed the cvcuda-flip-transforms branch from 9cb272b to 02c320a Compare November 19, 2025 23:09

zy1git force-pushed the cvcuda-flip-transforms branch from 02c320a to 330db00 Compare November 20, 2025 00:29

zy1git closed this Nov 20, 2025

zy1git reopened this Nov 20, 2025

AntoineSimoulin reviewed Nov 20, 2025

View reviewed changes

test/test_transforms_v2.py Outdated Show resolved Hide resolved

test/test_transforms_v2.py Outdated Show resolved Hide resolved

test/test_transforms_v2.py Outdated Show resolved Hide resolved

AntoineSimoulin reviewed Nov 20, 2025

View reviewed changes

torchvision/transforms/v2/functional/_geometry.py Show resolved Hide resolved

NicolasHug reviewed Nov 20, 2025

View reviewed changes

torchvision/transforms/v2/functional/_geometry.py Outdated Show resolved Hide resolved

NicolasHug reviewed Nov 20, 2025

View reviewed changes

torchvision/transforms/v2/functional/_geometry.py Outdated Show resolved Hide resolved

Update CVCUDA tests for horizontal and vertical flip and make changes…

98616f4

… according to the comments

zy1git force-pushed the cvcuda-flip-transforms branch from 330db00 to 98616f4 Compare November 24, 2025 23:25

AntoineSimoulin reviewed Nov 25, 2025

View reviewed changes

justincdavis reviewed Nov 25, 2025

View reviewed changes

		if CVCUDA_AVAILABLE:
		_transformed_types = (torch.Tensor, PIL.Image.Image, cvcuda.Tensor)

Implement Flip transforms with CVCUDA backend #9277

Are you sure you want to change the base?

Implement Flip transforms with CVCUDA backend #9277

Uh oh!

Conversation

zy1git commented Nov 19, 2025

Uh oh!

pytorch-bot bot commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/vision/9277

❗ 1 Active SEVs

❌ 9 New Failures, 1 Cancelled Job, 3 Unrelated Failures

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

justincdavis commented Nov 19, 2025

Uh oh!

AntoineSimoulin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

NicolasHug commented Nov 20, 2025

Uh oh!

justincdavis commented Nov 20, 2025

Uh oh!

AntoineSimoulin left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

AntoineSimoulin Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

zy1git Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

AntoineSimoulin Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

zy1git Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

AntoineSimoulin Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

justincdavis Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

justincdavis Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pytorch-bot bot commented Nov 19, 2025 •

edited

Loading

AntoineSimoulin left a comment •

edited

Loading

zy1git Nov 26, 2025 •

edited

Loading